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Abstract The paper is concerned with the coherent quantum Linear Quadratic Gaussian 
(CQLQG) control problem for time-varying quantum plants governed by linear quantum 
stochastic differential equations over a bounded time interval. A controller is sought among 
quantum linear systems satisfying physical realizability (PR) conditions. The latter describe 
the dynamic equivalence of the system to an open quantum harmonic oscillator and relate 
its state-space matrices to the free Hamiltonian, coupling and scattering operators of the os- 
cillator. Using the Hamiltonian parameterization of PR controllers, the CQLQG problem is 
recast into an optimal control problem for a deterministic system governed by a differential 
Lyapunov equation. The state of this subsidiary system is the symmetric part of the quantum 
covariance matrix of the plant-controller state vector. The resulting covariance control prob- 
lem is treated using dynamic programming and Pontryagin's minimum principle. The asso- 
ciated Hamilton- Jacobi-Bellman equation for the minimum cost function involves Frechet 
differentiation with respect to matrix- valued variables. The gain matrices of the CQLQG 
optimal controller are shown to satisfy a quasi-separation property as a weaker quantum 
counterpart of the filtering/control decomposition of classical LQG controllers. 
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1 Introduction 

Quantum feedback control, which deals with dynamical systems whose variables are non- 
commutative linear operators on a Hilbert space governed by the laws of quantum mechanics 
(see, for example, [24]), involves two major paradigms. One of them employs classical in- 
formation on the quantum mechanical system retrieved through a macroscopic measuring 
device and thus accompanied by decoherence and the loss of quantum information, as re- 
flected, for example, in the projection postulate [6]. The other, less "invasive", approach, 
apparently practiced by nature to stabilize matter on an atomic scale, is through direct inter- 
action of quantum mechanical systems, possibly mediated by light fields. With the advances 
in quantum optics and nanotechnology making it possible to manipulate such interconnec- 
tion, measurement-free coherent quantum controllers provide an important alternative to the 
classical observation-actuation control loop. 

Coherent quantum feedback can be implemented, for example, using quantum-optical 
components, such as optical cavities, beam splitters, phase shifters, and modelled by linear 
quantum stochastic differential equations (QSDEs) [7, 11, 13] corresponding to open quan- 
tum harmonic oscillators [2,4]. The associated notion of physical realizability (PR) [8, 14, 
16] reflects the dynamic equivalence of a system to an open quantum harmonic oscillator. 
Being organized as quadratic constraints on the state-space matrices, the PR conditions im- 
posed on the controller complicate the solution of quantum analogues to the classical Linear 
Quadratic Gaussian (LQG) and "Hoo-control problems, and it is particularly so in regard to 
the Coherent Quantum LQG (CQLQG) problem [12] which has yet to be solved. 

The CQLQG control problem seeks a PR quantum controller to minimize the aver- 
age output "energy" of the closed-loop system, described by the quantum expectation of a 
quadratic form of its state variables. For the original infinite-horizon time-invariant setting 
of the problem, a numerical procedure was proposed in [12] to compute suboptimal con- 
trollers, and algebraic equations were obtained in [22] for the optimal CQLQG controller. 
Subtle coupling of the equations, which comes from the PR constraints, is apparently re- 
lated to the complicated nature of the suboptimal control design procedure. This suggests 
that an alternative viewpoint needs to be taken for a better understanding of the structure of 
the optimal quantum controller. 

In the present paper, the CQLQG control problem is approached by considering its time- 
varying version, with a PR quantum controller being sought to minimize the average output 
energy of the closed-loop system over a bounded time interval. We outline a dynamic pro- 
gramming approach to the finite-horizon time-varying CQLQG problem by recasting it as 
a deterministic optimal control problem for a dynamical system governed by a differential 
Lyapunov equation. The state of the subsidiary system is the symmetric part of the quantum 
covariance matrix of the plant-controller state vector. The role of control in this covariance 
control [18] problem is played by a triple of matrices from the Hamiltonian parameterization 
of a PR controller which relates its state-space matrices to the free Hamiltonian, coupling 
and scattering operators of an open quantum harmonic oscillator [2]. 

The dynamic programming approach to the covariance control problem is developed 
in conjunction with Pontryagin's minimum principle [15]. The appropriate costate of the 
subsidiary dynamical system is shown to coincide with the observability Gramian of the 
underlying closed-loop system. The resulting Hamilton-Jacobi-Bellman equation (HJBE) 
for the minimum cost function of the closed-loop system state covariance matrix involves 
Frechet differentiation in noncommutative matrix-valued variables. Such partial differential 
equations (PDEs) were considered, for example, in [20] in a different context of entropy 
variational problems for Gaussian diffusion processes. 
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Using the invariance of PR quantum controllers under the group of symplectic equiv- 
alence transformations of the state-space matrices (which is a salient feature of such con- 
trollers), we establish symplectic invariance of the minimum cost function. This reduces the 
minimization of Pontryagin's control Hamiltonian [19] to two independent quadratic opti- 
mization problems which yield the gain matrices of the optimal CQLQG controller. As in 
the time-invariant case [22], this partial decoupling is a weaker quantum counterpart of the 
filtering/control separation principle of classical LQG controllers [9]. The equations for the 
optimal quantum controller involve the inverse of special self-adjoint operators on matrices 
[22], which can be carried out through the matrix vectorization [10, 18]. 

The paper is organised as follows. Sections 2 and 3 specify the quantum plants and co- 
herent quantum controllers being considered. Section 4 revisits PR conditions to make the 
exposition self-contained. Section 5 formulates the CQLQG control problem. Section 6 de- 
rives a PDE for the minimum cost function from the symplectic invariance of PR controllers. 
This PDE is solved in Appendix A to provide an insight into the structure of the function. 
Section 7 establishes the HJBE for the minimum cost function and identifies the costate in 
the related Pontryagin's minimum principle as the observability Gramian. Section 8 carries 
out the minimization involved in the HJBE and obtains equations for the optimal controller 
gain matrices, using the special linear operators from Appendix B. Section 9 summarizes 
the system of equations for the optimal CQLQG controller. Section 10 provides concluding 
remarks and outlines further research. 



2 Open quantum plant 

The quantum plant considered below is an open quantum system which is coupled to another 
such system (playing the role of a controller), with the dynamics of both systems affected 
by the environment. At any time t, the plant is described by a n-dimensional vector xt of 
self-adjoint operators on a Hilbert space, with n even. The plant state vector x t evolves 
in time and contributes to a p\ -dimensional output of the plant yt (also with self-adjoint 
operator-valued entries) according to QSDEs 

dx t = A t x t dt + B t dw t + Etdrjt, (1) 
dyt = C t x t dt + D t dw t . (2) 

Here, the matrices A t € R nx ™, B t € R" xmi , C t € R PlX ", D t € R PlXmi , E t € K" xp2 
are known deterministic functions of time, which are assumed to be continuous for well- 
posedness of the QSDEs, 

zt := Ctx t (3) 

is the "signal part" of the plant output yt, and r\t is the output of the controller to be described 
in Section 3. The noise from the environment is represented by an mi -dimensional quantum 
Wiener process w t (with mi even) on the boson Fock space [13] with a canonical Ito table 

dw t dw t T = (7 mi + iJ 1 /2)dt. (4) 

Here, i is the imaginary unit, I m is the identity matrix of order m (with the subscript some- 
times omitted), and J\ is a real antisymmetric matrix, which is given by 

J\ ■= ® J, J = (5) 
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(with (g) the Kronecker product of matrices, and m :— mi/2) and specifies the canonical 
commutation relations (CCRs) for the quantum noise of the plant as 

T T T T 

[dwt,dw t ] := du>tdto t — (dwtdw t ) = ij\&t. (6) 

Vectors are assumed to be organized as columns unless indicated otherwise, and the trans- 
pose (-) T acts on vectors and matrices with operator-valued entries as if the latter were 
scalars. Accordingly, the (j, fc)th entry of the matrix [W, W T ], associated with a vector W 
of operators W\, . . . , W r , is the commutator 

[Wj,W k ] :=WjW k -W k Wj. 

Also, (-)t := ((-)*) T denotes the transpose of the entry-wise adjoint (•)^ £ . In application to 
ordinary matrices, (-)t is the complex conjugate transpose ((-)) T an d will be written as (•)*. 



3 Coherent quantum controller 



A measurement-free coherent quantum controller is another quantum system with a n- 
dimensional state vector £t with self-adjoint operator-valued entries whose interconnection 
with the plant (l)-(3) is described by the QSDEs 

d£t = <Httdt + bt<lut + etdyt, (7) 
drjt = c t £,tdt + d t dtjf (8) 

Here, a t £ R" x ™, h £ R" xm2 , c t G K P2X ", d t £ R P2Xm2 , e t £ R" xpi are deterministic 
continuous functions of time, and, similarly to (3), the process 

Ct := ctit (9) 

is the signal part of the controller output rjt- The process ojt in (7) and (8) is the controller 
noise which is assumed to be an mi -dimensional quantum Wiener process (with mi even) 
that commutes with the plant noise w t in (1) and (2) and also has a canonical Ito table 
dujtdujj = (I m2 + iJi/tydt with the CCR matrix 

J 2 := I»2 ® J, (10) 

where ^ 2 '~ mi/2. In view of (7), the matrices b t and e t will be referred to as the controller 
noise and observation gain matrices, even though yt is not an observation signal in the classi- 
cal control theoretic sense. The combined set of equations (l)-(3) and (7)-(9) describes the 
fully quantum closed-loop system shown in Fig. 1. The process ( t in (9) is analogous to the 









plant 




contr. 





yt 



Fig. 1 The plant and controller form a closed-loop quantum system described by (l)-(3) and (7)-(9), which 
is influenced by the environment through the quantum Wiener processes w t and u>t . 
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actuator signal in classical control theory. Following the classical approach, the performance 
of the quantum controller is described in terms of an r -dimensional process 



Zt = F t x t + GtCt 



(11) 



Its entries are linear combinations of the plant state and "actuator output" variables whose 
relative importance is specified by the matrices Ft G R rXTl and Gt € R rxm2 which are 
known continuous deterministic functions of time t. The weighting matrices F t , Gt are free 
from physical constraints and their choice is dictated by the control design preferences. The 
2n-dimensional combined state vector 



Xt 



of the closed-loop system and the "output" Z t in (1 1) are governed by the QSDEs 

dX t = A t X t dt + BtdWt, Z t = C t X t , 
driven by the combined (mi + 7712) -dimensional quantum Wiener process 



(12) 



with a block diagonal Ito table 



dW f dW t T = {I mi +m 2 + iJ/2)dt, J := I^+^ 2 ® J 



(13) 



in conformance with (4)-(6), (10). The state-space matrices of the closed-loop system (12) 
are given by 



(14) 



'At 


Bt 








" At 


E t c t 


B t E t dt 


etC t 


at 


e t D t b t 


Ft 


Gtc t 






4 Physical readability conditions 

The CCRs for the closed-loop system state vector are described by a real antisymmetric 
matrix 

©t := -i[X u X?\ = 21mE(XtXt~), (15) 

which, up to a factor of 2, coincides with the entrywise imaginary part of the quantum 
covariance matrix of Xt, with E(-) the quantum expectation (associated, in what follows, 
with the vacuum state). The matrix Ot evolves in time according to a differential Lyapunov 
equation 

Gt = A t Ot + OtAj + B t JB? , (16) 



where J is the CCR matrix of the combined quantum Wiener process Wt from (13) in the 
sense that [dWt, dWj] = iJdt. Indeed, by employing the ideas of [8, Proof of Theorem 2.1 
on pp. 1798-1799] and combining the quantum Ito formula with the bilinearity of the com- 
mutator as 

d[X, Y] = [dX, Y] + [X, dY] + [dX, dY], 
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and using the adaptedness of the state process X t and the quantum Ito product rules [13], it 
follows that 

d[X t ,X?] =[AtXtdt + BtdWt, Xl] + [X t , X?A?dt + AwJbJ] 
+ [AtXtdt + BtdWt, X?Ajdt + dwjBj] 
=(A t [X t , Xj\ + [X t , X?]Aj)dt + Bt[dW t , dWj]Bj, 

which, upon division by i, yields (16). Now, suppose the initial plant and controller state 
vectors commute with each other, so that [xq, ($] = and the matrix (15) is initialized by 



6>o 



K x 
K 2 



(17) 



where K\, K 2 are nonsingular real antisymmetric matrices. Then the CCR matrix of X t is 
preserved in time if and only if 

At&o + 0oAj + BtJBj = (18) 

for any t > 0. The left-hand side of (18) is always an antisymmetric matrix. Hence, by 
computing two diagonal and one off-diagonal blocks of this matrix with the aid of (14), it 
follows that the CCR preservation is equivalent to three equations 

A t K t + K^Aj + BtJ\Bj + E t d t J 2 djEj = 0, (19) 
E t (c t K 2 + d t J 2 bJ) + {KxCj + BtJiDj) ej = 0, (20) 

V v ' S v ' 

controller plant 

a t K 2 + K 2 aJ + etD^DjeJ + b t J 2 bJ = (21) 

to be satisfied at any time t. Therefore, the fulfillment of the equalities 

c t K 2 + d t J 2 bJ = 0, (22) 
CtKi+DtJiB? = 0, (23) 

which pertain to the controller and the plant, respectively, is sufficient for (20). Note that (21) 
and (22) are the conditions of physical realizability (PR) [8, 12] of the quantum controller in 
the sense of equivalence of its input-output operator to an open quantum harmonic oscillator 
[4]. In a similar fashion, (19) and (23) are the PR conditions for the quantum plant. 

Lemma 1 Suppose the quantum plant satisfies the PR conditions (19), (23) and the matrix 
Et is of full column rank. Then the closed-loop system state CCR matrix 0q in (17) is 
preserved if and only if the controller satisfies the PR conditions (21), (22). 

Proof The "if" part of the lemma was considered above. The "only if" claim is established 
by left multiplying both sides of (20) by (Ej Ety 1 Ej . This is valid if E t is of full column 
rank and yields 

c t K 2 + dtJ 2 b T t = -(EjEt^Ej^Cj + BtJ^Dj). 



In this case, the fulfillment of the second PR condition for the plant (23) indeed entails the 
PR condition (22) for the controller. ■ 
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The fact that the CCR preservation property for the closed-loop system state alone "cov- 
ers" the separate PR conditions for the plant and controller as input-output operators is ex- 
plained by the "internalization" of their outputs which become part of the state dynamics 
when the systems are coupled. In what follows, the controller state CCR matrix is assumed 
to be given by 

K 2 = I V ® J=: J , (24) 
where v := n/2, so that the controller PR conditions (21) and (22) take the form 

at Jo + JoaJ + e t D t JiDjeJ + btJ 2 bJ = 0, (25) 
c t Jo + d t J 2 bJ = 0. (26) 

The first PR condition (25) is a linear equation with respect to a t whose solutions are pa- 
rameterized by real symmetric matrices R t of order n as 

at = {etDtJiDjeJ + b t J 2 bJ)J /2 +J R t . (27) 

v V ' 

at 

These solutions form an affine subspace in R™ x " obtained by translating the linear subspace 
of Hamiltonian matrices 

{a e M. nxn : a J + J a T = 0} = J §„ = S„J 

by a skew-Hamiltonian matrix at, that is, a particular solution of (25) which is a quadratic 
function of bt and et. Here, §„ denotes the subspace of real symmetric matrices of order n, 
and Rt £ S„ specifies the free Hamiltonian operator (J Rt£t/'2 of the quantum harmonic 
oscillator [2, Eqs. (20)-(22) on pp. 8-9]. Note that (27) is the orthogonal decomposition of 
at into projections onto the subspaces of skew-Hamiltonian and Hamiltonian matrices in the 
sense of the Frobenius inner product 

(X,Y) := Tr(X T Y). 

Since the canonical structure of J in (24) implies that J^ 1 = — J , the second PR condition 
(26) allows the matrix c t to be expressed in terms of bt as 

c t = dtJ 2 bJj . (28) 

The matrix dt, which quantifies the instantaneous gain of the controller output r/t with re- 
spect to the controller noise wt, is assumed to be fixed. Then (27), (28) completely parame- 
terize the state-space matrices of a PR controller by the matrix triple 

ut~(b t ,e t ,Rt), (29) 

which will be regarded as an element of the Hilbert space U := R nxm ^ x R" xpi x S„ with 
the inherited inner product {(&, e, R), (/3, e, p)) := (b, /3) + (e, e) + (7?, p). 
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5 Coherent quantum LQG control problem 

In extending the infinite-horizon time invariant case from [12, 22], the CQLQG control prob- 
lem is formulated as the minimization of the average output "energy" of the closed-loop 
system (12) over a bounded time interval [0, T]: 

,-T ,-T 
£ T ~ E(zJz t )At = (C?Ct,Pt)dt— >min, (30) 
Jo Jo 

where the minimum is taken over the maps t m> ut in (29) which parameterize the n- 
dimensional controllers (7)-(9) satisfying the PR conditions (25), (26). Here, Zj Z t is the 
sum of squared entries of the vector Zt from (11), which are self-adjoint quantum mechani- 
cal operators. Also, 

P t := ReE(X t X?) (31) 

is a real positive semi-definite symmetric matrix of order 2n (we denote the set of such 
matrices by S^~ n ) which is the entry wise real part of the quantum covariance matrix of the 
closed-loop system state vector Xt- The matrix Pt satisfies the differential Lyapunov equa- 
tion 

Pt = AtPt + PtAj + B t B? =: Ct, Ut (Pt)- (32) 

The affine operator Ct,u t is the infinitesimal generator of a two-parameter semi-group, 
which acts on and depends on the triple ut of current matrices of the PR controller 
from (29) through (14), (27), (28). If Pq were zero (which is forbidden by the positive semi- 
definiteness of the quantum covariance matrix E(XoXq) = Po+iOo/2 )p 0), then Pt would 
coincide with the controllability Gramian, over the time interval [0, t], of a classical linear 
time- varying system with the state-space realization triple (At, Bt,Ct) driven by a standard 
Wiener process. The fact that £ T in (30) is representable as the LQG cost of a classical sys- 
tem reduces the CQLQG problem to a constrained LQG control problem for an equivalent 
classical plant 





B t E t d t 


Et 


Ft 





G t 





I 





Ct 


D t 


. 



(33) 



driven by an (mi + 1712) -dimensional standard Wiener process, with the controller being 
noiseless. In accordance with the standard convention, the block structure of the state-space 
realization in (33) corresponds to partitioning the input into the noise and control, and the 
output into the to-be-controlled and observation signals. We will develop a dynamic pro- 
gramming approach to (30) as an optimal control problem for a subsidiary dynamical sys- 
tem with state Pt in (31) governed by the ODE (32) whose right-hand side is specified by 
the matrix triple ut from (29). With the time horizon T assumed to be fixed, the minimum 
cost function is defined by 

i-T 

V t (P):=miJ (cJc s ,Ps}ds (34) 

for any t £ [0, T] and P £ S^ n . Here, the infimum is taken over all admissible state- 
space matrices of the PR controller on the time interval [t,T], provided the initial symmetric 
covariance matrix of the closed-loop system state vector is ReE(Xt Xj) = P. 
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6 Symplectic invariance 

As in the time-invariant case [22], the PR conditions (25)-(26) are invariant with respect to 
the group of symplectic similarity transformations of the controller matrices 

-1 L , L -1 



at <-¥ crater 



bt >-> crbt, et i-> tret, c t m> c t a~ 



where a is an arbitrary (possibly, time-varying) symplectic matrix of order n in the sense 
that aJo<J T = Jo- This corresponds to the canonical state transformation £t n> a^u see 
also [17, Eqs. (12)— (14)]. Any such transformation of a PR controller leads to its equivalent 
state-space representation, with the matrix Rt transformed as Rt n> a~ T Rta^ 1 , where 
(•)~ T := ((■) _1 ) T . Hence, the minimum cost function Vt(P) in (34) is invariant under the 
corresponding group of transformations of the closed-loop system state covariance matrix 
P, that is, 



V t (SPS 1 ) = V t (P), 



S := 



I 
a 



(35) 



for any symplectic matrix a. Assuming that Vt (P) is Frechet differentiable in P, its sym- 
plectic invariance (35) can be described in differential terms. To formulate the lemma below, 
the matrix P G S f n is split into blocks as 



P := 



Pll 
P21 



P12 
P22 



[P.i P.2] f 



2 11 



—2n— 
Pi. 

P2,\ tr 



r 



(36) 



where P\\ is associated with the state variables of the plant, whilst P22 pertains to those 
of the controller. The §2n -valued Frechet derivative of the minimum cost function has an 
analogous partitioning 



Qt(P) :=d P V t (P) = 



' 9 Pll V t d Pl2 V t /2 

dp 21 v t /2 dp 22 v t _ 



(37) 



where the 1 /2-factor takes into account the symmetry of P. Note that Qt = since Vt = 

+ 

2n 

H t (P) ~ Qt(P)P, (38) 



in view of (34). Associated with V t is a map H t : S+„ ->■ R 2nx2n defined by 



which is also partitioned into blocks as in (36) except that the matrix Ht(P) is not necessar- 
ily symmetric. Also, 

H(N) := - J S(J N) = -S(JVJo) Jo = (N+ J N T J )/2 (39) 

denotes the orthogonal projection of a matrix N £ R nxn onto the subspace of Hamiltonian 
matrices, with S the symmetrizer defined by 

S(N) := (N + N T )/2. (40) 

Lemma 2 Suppose the minimum cost function Vt(P) in (34) is Frechet differentiable with 
respect to P E S^. Then it satisfies the PDE 

H(H t 22 (P)) = 0, (41) 

which means that the controller block of the matrix Ht(P) from (38) is a skew -Hamiltonian 
matrix. 
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Proof The transformations P h» SPS t in (35) form a Lie group whose tangent space can 
be identified with the subspace of Hamiltonian matrices r of order n. As the matrix expo- 
nential of a Hamiltonian matrix, a e := e £T is a symplectic matrix for any real e. Therefore, 

if Vt is smooth, then by differentiating the left-hand side of (35) with S E := ^ ® J as a 
composite function at e = 0, it follows that the resulting Lie derivative [5] of Vt vanishes: 

= d s V t (S s PSj)^_ o = 2 (dpV t , [°i]rP2.) 

= 2([0l]Q t P.2,T) =2(H(H t 22 (P)),r). (42) 

Here, the notations (36)-(38) are used. Since the relation (42) is valid for any Hamiltonian 
matrix r, then (41) follows. ■ 

The relation (41) is, in fact, a system of first order scalar homogeneous linear PDEs 
which are associated with n(n + l)/2 entries of a real symmetric matrix of order n. This 
system is underdetermined since, for a fixed Pn, the total number of independent scalar 
variables is n(3n + l)/2. As we will show in Appendix A, this system of PDEs satisfies 
the involutivity condition and is locally completely integrable by the Frobenius integration 
theorem [5]. Instead of "disassembling" (41) into scalar equations which would lead to the 
loss of the underlying algebraic structure, it can be treated as one PDE with noncommutative 
matrix-valued variables. Such PDEs were encountered, for example, in entropy variational 
problems for Gaussian diffusion processes [20]. The general solution of the PDE (41) is 
given below. 

Theorem 1 Suppose ft : St x R nxn — > R is a continuously Frechet differentiable function. 
Then the function 

V t {P) := ft{Pll,Pl2{P22 + Jo)P2l), (43) 
defined for P £ §£ n with P22 >- 0, satisfies the PDE (41). Moreover, (43) describes a 
general smooth solution of the PDE over any connected component of the set {P £ S^ n ■ 
det P12 / 0, P22 y 0}. 

The proof of Theorem 1 is given in Appendix A and employs ideas from the method 
of characteristics for conventional PDEs [3,23] in combination with a nonlinear change of 
variables. The theorem shows that, due to the symplectic invariance, which holds at any 
time t, the minimum cost function Vt(P) depends on the matrix P only through the special 
combinations of its blocks Pu, P12P22 P21, P12J0P21 which constitute a maximal set of 
nonconstant invariants of P with respect to the transformation group P m> SPS t described 
in (35). 

7 Hamilton- Jacobi-Bellman equation 

Assuming that the minimum cost function Vt(P) from (34) is continuously differentiable 
with respect to t and P in the sense of Frechet, the dynamic programming principle yields 
the HJBE 

dtVt(P) + inf n t (P, u, Q t (P)) = 0. (44) 

Here, the minimization is over the triple u := (b,e,R) of the current matrices b := bt, 
e := e t , R := R t of the PR controller, and the map Q t : S^ n — s> §2n is associated with Vt 
by (37). Also, 

n t (P,u,Q) := (CJC U P) + {Q,£t,u(P)) (45) 
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is the control Hamiltonian [19] of Pontryagin's minimum principle [15] applied to the 
CQLQG problem (30) as an optimal control problem for the dynamical system (32) with 
state P and control u. In view of (14), (27), (28) and (32), the matrix R e § n , which param- 
eterizes the free Hamiltonian operator of the PR controller, enters the control Hamiltonian 
IJt(P, b, e, R, Q) only through At and in an affine fashion. Moreover, by considering (45) 
with Q = Qt(P) as in (44), it follows that 

n t (P, b, e, R, Q t (P)) = n t (P, b, e, 0, Q t (P)) + 2 (Qt{P), 

= n t (p, b, e, o, Q t (P)) + 2 (h(hP(p)), j r) 

= n t (P,b,e,0,Q t (P)), (46) 

where we have used the notations (36)-(38) and Lemma 2. The /^-independence of the 
right-hand side of (46) reduces the minimization problem in (44) to 

inf n t (P,u,Q t (P))=mfn t (P,b,e,0,Q t (P)). (47) 

«GU b.e 

This does not mean, however, that R t = has to be satisfied for the optimal quantum 
controller. The optimization problem (47) is solved in Section 8. We will now show that the 
map Q t from (37), evaluated at an optimal trajectory of the system (32) and thus describing 
the costate of this system through the Pontryagin equations 

Pt = d Q n t , Q t = -d P n t , (48) 

coincides with the observability Gramian of the closed-loop system (14). Here, () is the 
total time derivative, and the partial Frechet derivatives of (45) are taken with respect to Q, 
P as independent S2«-valued variables. 

Lemma 3 Suppose the minimum cost function Vt(P) in (34) is twice continuously Frechet 
differentiable in t and P. Also, suppose there exist functions b^(P) and e^(P) which are 
Frechet differentiable in P and deliver the minimum in (47). Then the matrix Qt := Qt(Pt), 
obtained by evaluating the map (37) at an optimal trajectory P t ° of the system (32), satisfies 
the differential Lyapunov equation 

Qt = -AfQt - QUt - Cfct, (49) 

where At, C° are the corresponding state-space matrices of the closed-loop system with the 
optimal CQLQG controller. 

Proof The twice continuous differentiability of Vt (P) ensures the interchangeability of its 
partial derivatives in t and P, so that dpdtVt = dtdpVt = dtQt in view of (37). Hence, by 
substituting (47) into (44) and differentiating the HJBE with respect to P, it follows that 

dtQt + d P n t (P, bt, et, 0, Q t )/dP = 0. (50) 

Since the pair [bt, et) minimizes n t (P, b, e, Q t ) in (b, e) £ R nxm ^ x R nxpi (that is, over 
an open set), then it is a critical point of this function, where both d^IIt and d e Flt vanish. 
Therefore, 

dn t (p,bt,et,o,Qt)/dP=d P nt + (d P bt)Hd b n t ) + (dpet)Hden t ) + (d P Q t )\d Q n t ) 
=d P n t + (dpQt^(d Q n t ), (51) 
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where dpQt = dpVt is a well-defined self-adjoint operator on S2 n in view of (37) and the 
twice continuous Frechet differentiability of Vt- Now, from (32) and (45), it follows that 

d P IJ t = cJCt + (d P £t.u(P))\Q) = AtQ + QAt + cJCt. (52) 

Since (45) implies that dQlJ t = C t , u {P), then substitution of (51), (52) into (50) yields the 
PDE 

dtQt+At T Qt + QtAt+C? T C? + (d P Qt)Hc t .u t (P)) = 0. (53) 

For the matrix Ff governed by (32) with u t = {b%,e%,R t ), the matrix (d t Qt)(P?) + 
i.dpQt)\C t ,u t {Ff)) = dQ t (Ff)/dt is the total time derivative of Q%. Hence, (53) leads to 
(49). ■ 

The ODE (49), whose right-hand side coincides with —dpFIt in view of (52) and in con- 
formance with (48), is the differential Lyapunov equation which governs the observability 
Gramian Q° of the closed-loop system under the optimal CQLQG controller. It is solved 
backwards in time t ^ T with zero terminal condition = 0. This (or an alternative 
reasoning involving the monotonicity of Vt(P) in P), can be used to show that the map Qt 
given by (37), takes values in 8^~ n . Therefore, Ht{P) in (38) is a diagonalizable matrix with 
all real nonnegative eigenvalues which correspond to the squared Hankel singular values 
of the closed-loop system in view of the interpretation of Q t and P t as observability and 
controllability Gramians. We will refer to H t as the Hankelian of the closed-loop system. 



8 Optimal controller gain matrices 

Since the matrix Ct in (14) depends only on bt in view of (28), the minimization on the 
right-hand side of (47) can be represented as 



inf FI t {P, u, Q t ) = inf {{cJCt, P) + inf (Q t , C tAefi {P))) 



(54) 



which is a repeated minimization problem over the PR controller gain matrices b := bt and 
e := e t . Here, 

C tAet0 {P) = A t P + PAl+B t Bj (55) 

is the Lyapunov operator from (32) obtained by letting Rt = in (27) and substituting the 
remaining skew-Hamiltonian part at of the controller matrix at into (14), which yields 



At 



' A t Etc t 
etCt a t 



(56) 



In turn, the repeated minimization problem (54) can be decoupled into two independent 
problems as follows. In view of the structure of the matrices at and ct in (27) and (28), 
the matrix At from (56) is a quadratic function of the controller gain matrices b, e. The 
dependencies of At on b and e can be isolated as 



At = 



'At 


o" 










+ 





E t dtJ2b 1 J 
bJ 2 b T J /2 



+ 











eCt eD t JiDje T J /2 



(57) 
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where the matrix At is independent of both b and e, whilst At only depends on b and At 
only depends on e. In a similar vein, (14) and (28) imply that 



B t Bj = 



CtC t 



B t Bj + E t dtdjE? 1 



+ 



Etdtb 1 



bdjEj 



bb 1 



+ 







B t Dje T 



eD t B t L eDtDle 1 



>t T Ft 0" 


+ 








F?Gtd t J 2 b T .h 
■hb-hdt Gj F t Job^dJ Gj Gtd t J2b T Jo 



(58) 



(59) 



where r t °, A® are independent of both b and e, the matrices E t , A t only depend on b, whilst 
ri only depends on e. By substituting (57), (58) into (55) and combining the result with 
(59), the repeated minimization problem in (54) is indeed split into 



inf n t (P, u, Qt) ={At, P) + (Qt, 2A° t P + if) 

u 

+ mi((A t ,P) + (Qt,2AtP + ft)) 

b 

+ mi(Qt,2A t P + r t )) 



(60) 



Both minimization problems on the right-hand side of (60) are quadratic optimization prob- 
lems whose solutions are available in closed form and lead to the optimal values for the 
controller gain matrices b and e. The fact that these problems are independent describes a 
quasi-separation property of the gain matrices [22] and can be interpreted as a weaker quan- 
tum counterpart of the filtering/control separation principle of the classical LQG control. We 
will first consider the minimization with respect to the controller observation gain matrix e. 
From (57) and (58), it follows that 



(Qt,2AtP + r t ) = (Q 




2eC t eDt^Dje 1 



Jo 



P 







B t Dje T 



eD t Bj eD t Dje T 



= (2{H^CJ + QfBtDj) + Wlt(e), e>, 



where 



Tit 



IH^J^DtJ^J | Q?,D t D?l 



(61) 



(62) 



is a self-adjoint operator of grade two (see Appendix B) on R nxpi . Here, we have used 
the property that the matrix Hf 2 Jo is antisymmetric since the controller block H? 2 of the 
Hankelian (38) is skew-Hamiltonian in view of (41). If the operator dJlt is positive definite, 
then the quadratic function on the right-hand side of (61) achieves its minimum value 



aun(Qt,2AtP + rt) = —\\H 21 cJ + Q 21 BtDj" 2 



at a unique point 



et := -m^iH^Cj + Q?B t Dj) 



Here, for a positive definite self-adjoint operator £> on the Hilbert space 
standard Frobenius inner product (■,■), we denote by 



(63) 
(64) 

5X « with the 



||JV||o := v^vy; 
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the norm of a matrix N G M. pxq associated with the "weighted" Frobenius inner product 

(K,N) := (K,D(N)). 

The minimization in (60) with respect to the controller noise gain matrix b is performed in a 
similar fashion. It follows from (57)-(59) that 

(A t ,P) + {Q u 2A t P + f t ) 

=2(P 21 ,J bJ 2 djG?F t ) + (P 22 , J bJ 2 d?GlGtd t J 2 b T J ) 



+ {Qt, 



2E t d t J 2 b T J 
bJ 2 b T J 



P + 



E t d t b T 
bdjEj bb T 



where 



2(Q 21 E t d t + M(H} 2 ) T E t + P 21 F?G t )d t J 2 ) + <Jl t (b), b) , (65) 



<n t := IH? J , J 2 | Of, / | J P 22 Jo, J 2 dJ Gj G t d t J 2 J (66) 



is a self-adjoint operator of grade three (see Appendix B) on R nxm2 in view of the antisym- 
metry of H 22 J . If <Jt t is po 
achieves its minimum value 



metry of H 22 Jo- If *Jlt is positive definite, then the quadratic function of b, given by (65), 



mm((A t ,P} + {Qt,2AtP + r t )) 

b 

= -WQfEtdt + J {{Hl 2 ) T E t + P 21 F?G t )d t Jsll^-x (67) 

at a unique point 

bt := ~^ x {QfE t d t + M(H} 2 ) T E t + P 21 F?G t )d t J 2 ). (68) 
Finally, by substituting (63), (67) into (60) and using the representation 

(A° t , P) + (Q t , 2A° t P + if) = (F?F t , P n ) + 2(H}\A t ) + {q\\ B t Bj + E t d t djEj), 

which follows from (57)-(59), the HJBE (44) is reduced to the Hamilton-Jacobi equation 
(HJE) below. 

Theorem 2 Suppose the minimum cost function Vt(P) for the CQLQG problem, defined 
by (34), is continuously Frechet differentiate in t and P, and the associated self-adjoint 
operators Tit and Vlt in (62) and (66) are positive definite. Then the function Vt(P) satisfies 
the HJE 

n \ , r,/rrll A \ , mil n n T , t-, , ,T ,-,Tv 



d t Vt + (F^ Ft, Pn) + 2(H}\ A t ) + (Q\\ B t Bj + E t d t dJ E? 

■ ■ 2 



WQfEtdt + J ((H} 2 ) T E t + P 2 iF?G t )d t J 2 f 



\HtC t L +QtB t D t L \\i n - 1 =0, 



and the gain matrices b%, e\ of an optimal PR controller are computed according to (68) 
and (64). 
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By using [22, Lemma 5], it can be shown, that if the controller block Qf of the closed- 
loop system observability Gramian is nonsingular, the matrix Dt is of full row rank and 
r ((Qt 2 )~ 1 H? 2 Jo) < 1> w i tn r (') the spectral radius of a matrix, then both operators OTt and 
9Tt are positive definite. Since each of the matrices Qt (P) and P enters dJlt and 9Tt in (62) 
and (66) in a linear fashion, the dependence of bt and e\ on these matrices is linear-fractional 
and hence, smooth, provided $Bit y and 9Ti >- (such values of P form an open set). 
Therefore, if, in addition to the assumptions of Theorem 2, the minimum cost function Vt (P) 
is twice continuously Frechet differentiable with respect to P, then the optimal CQLQG 
controller gain matrices 6* and e% are continuously Frechet differentiable functions of P. 
If, furthermore, Vt is twice continuously Frechet differentiable in t and P, this ensures the 
applicability of Lemma 3, which utilizes the viewpoint of Pontryagin's minimum principle 
on the CQLQG problem. 



9 Equations for the optimal quantum controller 

The set of equations for the optimal CQLQG controller over the time interval t ^ 
T consists of two Lyapunov ODEs (32) and (49) for the controllability and observability 
Gramians Pt, Qt of the closed-loop system: 



Pt =A t Pt + PtAj + B t Bj , 



Qt 



AjQt 



QtAt 



■ CtCt , 



(69) 
(70) 



with the split boundary conditions P = P and Q T = 0, where P £ §^ n is a given matrix 
satisfying P + i€>o/2 ^= 0. According to (14), (27), (28), the closed-loop system matrices 
At, Bt, Ct are expressed in terms of the controller matrices bt, e*, Rt as 



At 



Bt := 
Ct := 



At 
etCt 

B t 
etD t b t 

[F t G t dtJ 2 bJJo} 



E t dtJ 2 b t L Jo 
(e t D t JiDjeJ + btJ 2 bJ)Jo/2 + J Rt 

E t dt 



(71) 

(72) 
(73) 



In turn, the optimal controller gain matrices bt, e t are completely specified by the Gramians 
Pt, Qt (which determine the closed-loop system Hankelian H t ) according to (64), (68) as 

e t :=-lH! 2 J a ,Dt.hDj \ Qf , D t Dj\~ l (H^cj + Q 21 B t Dj), (74) 
b t ~-lHpJ Q ,J 2 | Q 22 ,I | J Q P 22 Jo, J^TGTGtdtJ^- 1 

{QfEtdt + J ({H} 2 ) T Et + P 2 iF?G t )d t J 2 ), (75) 



where the inverses of the special self-adjoint operators can be represented through the vec- 
torization of matrices; see Appendix B. Therefore, the set of equations for the optimal 
CQLQG controller is a split boundary value problem for two Lyapunov ODEs (69), (70) 
which are nonlinearly coupled through the algebraic equations (71)-(75). The matrix R t , 
which affinely enters the right-hand side of these ODEs through the matrix At in (71), ap- 
pears to be a free parameter in the sense that an equation for its optimal value is missing and 
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the optimal controller gain matrices et and bt in (74) and (75) do not depend on the current 
value of Rt- Moreover, by using the identity 

Ht =QtPt + QtPt 

= - {AtQt + QtAt + cJc t )P t + Qt(A t P t + PtAj + B t B?) 
=[H t ,Aj] + QtBtBj - cJCtPt 

(see also [21, Appendix C]), it can be shown that the skew-Hamiltonian structure of H? 2 
in (41), trivially ensured at T = by the terminal condition Qx = 0, is preserved by 
the dynamics (69)-(75) for t < T regardless of the choice of Rt. However, the function 
[0, T] 3 1 1->- Rt 6 §„ is responsible for the fulfillment of the split boundary conditions. 

10 Concluding remarks 

We have considered a time-varying Coherent Quantum LQG control problem which seeks 
a physically realizable quantum controller to minimize the finite-horizon LQG cost, and 
outlined a novel approach towards its solution. Using the Hamiltonian parameterization of 
PR controllers, which relates them to open quantum harmonic oscillators, we have recast 
the CQLQG problem as a covariance control problem. Dynamic programming and Pon- 
tryagin's minimum principle have been applied to the resulting optimal control problem 
for a subsidiary deterministic dynamical system whose state is the symmetric part of the 
quantum covariance matrix of the closed-loop system state vector governed by a differential 
Lyapunov equation. It has been shown that the corresponding costate is the observability 
Gramian of the closed-loop system. By using the invariance of the minimum cost function 
under the group of symplectic similarity transformations of PR controllers, we have derived 
algebraic equations for the gain matrices of the optimal CQLQG controller and established 
their partial decoupling as a weaker quantum analogue of the classical LQG control/filtering 
separation principle. These equations express the optimal controller gain matrices in terms of 
the current observability and controllability Gramians of the closed-loop system thus lead- 
ing to a split boundary value problem for two nonlinearly coupled differential Lyapunov 
equations. The difficulty of solving this problem lies in the coupling of the differential equa- 
tions and mixed nature of the boundary conditions. However, the special structure of the 
minimum cost function, enforced by the symplectic invariance, suggests the possibility of 
reducing the order of these equations by nonlinear transformation of the blocks of the Grami- 
ans. Another resource yet to be explored is to consider the CQLQG problem for PR plants. 
The existence/uniqueness of solutions to the equations for the state-space realization ma- 
trices of the optimal CQLQG controller remains an open problem and so do their possible 
reduction and numerical implementation. These issues are a subject of current research and 
will be reported in subsequent publications. 
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A Proof of Theorem 1 

By omitting the dependence of the minimum cost function Vt(P) on t and Pn which are assumed to be 
fixed, and introducing the variables 

X:=Pi2 = P 2 T 1 , Y:=P 22 , (76) 

the PDE (41) takes the form 

H(M(V)) = 0. (77) 

Here, use is made of (36)-(40), and M is a linear differential operator which maps a Frechet differentiable 
function F x " X S„ 9 (X, Y) M- v(X, Y) G R to an R nx "-valued function M(v) defined on the same 
domain by 

M(v) := ^X T d x v + Yd Y v. (78) 

The next section verifies the involutivity of the PDE (77) as a system of scalar PDEs. Then we consider two 
particular solutions of this PDE in Section A.2 which allow its general solution to be obtained in Section A.3 
through a change-of-variables technique. 
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A.l Verification of involutivity 

We will now verify the fulfillment of the local complete integrability conditions for the PDE (77), which in 
view of (39) and (40) is equivalent to 

S(M(V) J ) = (79) 

whose left-hand side is a real symmetric matrix of order n. If the complete integrability holds, then the PDE 
has a n 2 -dimensional integral manifold which can be represented using an R nxn -valued function of the 
matrices X and Y. For any constant matrix Z g S n , let L z be a linear operator which maps a Frechet 
differentiable function l" x "x§„9 (X, Y) M> v(X, Y) G R to a function L z (v) : R nxn X §„ -> R 
defined by 

L z (v) = (Z,S(M(v)J ))=-(ZJ ,M(v)) = -A ZJo (v). (80) 

Here, M is given by (78), and Am is a linear differential operator, which is associated with a matrix N G 
1™ X ° and maps the function v to another function of X, Y as 

A N (v) :=(N,M(v)). (81) 

Thus, the operators L z , associated with symmetric matrices Z by (80), are the operators Am considered 
for Hamiltonian matrices N, although, in general, the matrix in (81) can be arbitrary. By the Frobenius 
integration theorem [5] (see also, [1, pp. 158-165]), the local complete integrability of the PDE (79) will be 
proved if we show that for any constant matrices Z\,Z^ G §n, there exists a matrix Z3 e S„, which is 
allowed to depend on X and Y and such that 

[L Zl ,L Z2 ](v) = L Z3 (v) (82) 

is satisfied for any twice continuously Frechet differentiable function v described above. The relation (82) is 
an inner product form of the involutivity condition for the PDE (79) regarded as a system of scalar PDEs, 
with the inner product used to represent linear combinations of the individual equations in a coordinate-free 
fashion. 

Lemma 4 For any constant matrices N\,N% G R nxn , the operators (81), considered on twice continu- 
ously Frechet differentiable test functions v, satisfy the commutation relation 

[Anx , An 2 ] = /1[jv 1 ,jv 2 ]/2- ( 83 ) 

Proof For any N G R n x n and any twice continuously Frechet differentiable function v : R n x " x§„ 1— > R, 
the function Afj(v) is continuously Frechet differentiable and its derivatives are computed as 

d x A N (v) =^(d x vN T +d x v(XN)) + d Y d x v{YN), 

dyA N {v) =^(Nd Y v + d Y vN T + d Y v(YN + N T Y) + d x d Y v(XN)), 

where the relation dxdyv = (d Y dxv)^ and self-adjointness of the linear operators d x v and d Y v are used. 
Hence, the composition of the differential operators (81), associated with Ni, N2 G R nxn , is computed as 

A Nl (A N2 (v))=\ (Xm , 8 X A N2 (v)) + (YNi , d Y A N2 (v)) 

= i(X7ViAT 2 , d x v) + i (NjYNi + N?YN 2 + YN1N2 + N^N^Y, d Y v) 

+ ^(XN 1 ,d x v(XN 2 )) + ^(XN 1 ,d Y d x v{YN 2 )) 

+ ^{YN 1 ,d x & Y v(XN 2 )) + jiYN-t + N?Y,d Y v(YN 2 + N^Y)). (84) 

Note that the part of the right-hand side of (84), which involves the second-order derivatives of v, is invariant 
under the transposition (Ni , JV 2 ) i-y (N2,N\), and so also is the matrix TVj YNi + YN2 . Therefore, 
the commutator of /l^ and A N2 takes the form 

[A Nl ,A N2 ](v) = ^(X[N 1 ,N2],d x v) + ^(Y[N 1 ,N2],d Y v) 

= ^{Wi,N2],M(v)) = A [NliN2]/2 (v), 
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which holds for twice continuously Frechet differentiable functions v, thus establishing (83). ■ 

In view of the Frobenius theorem mentioned above, Lemma 4 implies the local complete integrability of 
the PDE 

M(V) = 0, (85) 

whose left-hand side is given by (78). The solutions of (85) are also solutions of the PDE (77), but not visa 
versa. Since (85) is a system of n independent scalar PDEs, the integrability suggests that it has a n(n + 
l)/2-dimensional integral manifold which can be represented using an S n -valued map. Moreover, Lemma 4 
also establishes the involutivity for the PDE (77), with its n 2 -dimensional integral manifold representable by 
an R nxn -valued map, or a pair of maps with values in S„ and A n , where A n := §^ ' s the subspace of real 
antisymmetric matrices of order n, which is the orthogonal complement of the subspace §„ in the sense of 
the Frobenius inner product in R" x 11 . Indeed, 

[Z^Jo, Z2J0] = (Z1J0Z2 — Z2JqZi)Jo 

for any Z\,Z2 € S», in accordance with the fact that the commutator of Hamiltonian matrices is also a 
Hamiltonian matrix. Therefore, by applying (83) to the operators (80), it follows that the involutivity condition 
(82) holds with Z3 := (Z2J0Z1 — Z\JqZ2)/2 6 S n , which implies the local complete integrability for 
the PDE (77). 



A.2 Two particular solutions 

Lemma 5 Suppose f : S n — > R is a Frechet differentiable function. Then the function 

V(X, Y) := f(XY~ 1 X T ) (86) 

defined for X e R nxn and Y g S„ with detY ^ 0, satisfies the PDE (85). Moreover, (86) describes 
the general smooth solution of (85) on every connected component of the set {(X, Y) G BJ™ X ™ x E> n ■ 
det(XY) + 0}. 

Proof With the matrices X £ i" x " and Y e S n , where det F^O.we associate the matrices 

U — XY- 1 , W := XY- X X T . (87) 

The Frechet derivatives of W with respect to X and Y are expressed in terms of special linear operators of 
grade one (see Appendix B) and the matrix transpose operator T as 

d x W =lI,U T i + lU,IlT, dyW = -lU,U T i, (88) 

where the composition M o N of linear operators M and N is written briefly as MN . Indeed, the first 
variation of the matrix-valued map (X, Y) W in (87) is computed as 

8W = (6X)Y~ 1 X T + XY~ 1 (5X) T - XY' 1 (SY)Y' 1 X T 

= (8X)U T + U8X T -U(SY)U T , (89) 

which implies (88). The Frechet derivatives of the composite function V = f o W from (86) are 

d x v=(d x w)Hf) = (i/,(7i+rpy T , /])(/') =2fu, <m 

where /' is the S n -valued Frechet derivative of the function /, and the relations |ct, /3|t = |a T , /3 T | and 
7~t = Tare used, with (-)t the adjoint with respect to the Frobenius inner product of matrices. By a similar 
reasoning, 

d Y V = (d Y W)Hf) = -U T f'U. (91) 
Substitution of (90) and (91) into the left-hand side of (85) yields 

M(V) = (X T - YU T )f'U = 0, (92) 

so that the function V given by (86) indeed satisfies the PDE. We will now show that (86) is, in fact, the 
general smooth solution of the PDE (85) under the additional condition dct X 7^ 0, in which case both U and 
W in (87) are nonsingular. To this end, using the ideas of the method of characteristics for conventional PDEs 
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[3,23], we will prove that any smooth function V satisfying the PDE (85) is constant on every connected 
component of the preimage 

W-^S) := {(X,Y) £ R nxn X S n : 

det Y 0, XY~ 1 X T = S} (93) 

of any given nonsingular matrix S 6 S n under the map (X, Y) >-> W, with W- 1 the functional inverse. 
Indeed, let [0, l]9s4 (X, Y) 6 W^ 1 {S) be a smooth curve lying in this set. By differentiating the map 
W along such a curve and using (89), it follows that = W = XU T + UX T — UYU T , which, in view 
of dot U ^ 0, allows Y to be expressed in terms of X as 

Y = U~ 1 X + X T U~ T , (94) 

where ( ) := d s ■ Hence, differentiation of V as a composite function along the curve yields 

V = (d x V,X) + (d Y V,Y) 

= (d x V,X) + {d Y v,u- 1 x + x T u- T ) 

= {d x V + 2U- T d Y V,X), (95) 

where use is made of (94) and the symmetry of the matrix dyV. Since the PDE (85) implies that dxV + 
2U~ T d Y V = 2X- T M(V) = 0, then (95) yields V = 0. Hence, every smooth solution V of (85) is 
constant over any connected component of the set W^ 1 (S) from (93). Indeed, existence of two distinct 
points, which are connected by a smooth curve in W~ 1 (S) and such that V takes different values at these 
endpoints, would contradict the constancy of V along any such curve established above. Thus, V(X, Y) 
can only depend on X and Y through their special combination XY~ 1 X T , and the ODE (94) generates 
characteristic curves on which smooth solutions of the PDE are constant. ■ 

Note that since (86) involves the matrix inverse Y^ 1 , the explicit representation of the solution would 
be hard to guess by treating (85) as a system of scalar PDEs. 

Lemma 6 Suppose f : A„ — ► E is a Frechet differentiable function. Then the function V : R" x n — > K 
defined by 

V(X) := f(XJ X T ), (96) 
with Jo the canonical antisymmetric matrix of order n from (24), satisfies the PDE 

S(X T d x VJ ) = 0. (97) 

Moreover, (96) describes the general solution of (97) among Frechet differentiable functions of X over any 
connected component of the set det X^0. 

Proof Since 

d x {XJ Q X T ) = II, J X T l + IX Jo, JIT, 
then differentiation of (96) as a composite function of X yields 

dxV = -f'X Jo - (J X T f) T = -2f'XJ , 

where we have also used the antisymmetry of the matrix /'. Hence, X T dxVJo = 2X T f'X is antisymmet- 
ric whence (97) follows. Now, to prove the converse, let [0, 1] 3 s >— > X S R nxn be an arbitrary smooth 
curve in the set {X £ R nxn : XJoX T = J?}, where f2 £ A„ is a given nonsingular antisymmetric 
matrix. By differentiating X JqX t along such a curve, it follows that 

X J X T + XJ X T = 0. (98) 

Since det X^0, then the left multiplication of (98) by X^ 1 and right multiplication by X~ T yields 

A(X- 1 XJ ) = 0, (99) 

with A the antisymmetrizer defined by the orthogonal projection onto the subspace A n of real antisymmetric 
matrices of order n as 

A(AT) := N - S(AT) = (AT — N T )/2. 
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In view of (39), the relation (99) is equivalent to the matrix X 1 X being Hamiltonian. Now, if V is an 
arbitrary smooth solution of the PDE (97), then its derivative along the curve is 

V = (d x V,X) = (X T d x VJo,X- 1 XJ ) 
= (S(X T d x VJ ),S(X- 1 XJ )) 

+ (A(X T d x VJ ), Apf-iXJo)) = 0, (100) 

where the Frobenius inner product is partitioned according to the orthogonal decomposition R n x n = §„ © 
A„. In view of (100), the solution V is constant over any connected component of the set where X JoX T is 
a given nonsingular matrix, thus implying the representation (96). ■ 



A.3 General solution 

The following theorem shows that the particular solutions of the PDE (77), obtained in the previous section, 
can be "assembled" into the general solution. 

Theorem 3 Suppose f : R nxn — > R is a Frechet differentiable function. Then the function 

V{X,Y) := f{X{Y~ 1 + J Q )X T ), (101) 

defined for X G R nxn and Y G S n , with det Y ^ 0, satisfies the PDE (77). Moreover, (101) is a general 
smooth solution of the PDE over any connected component of the set {(X,Y) G R nxn x§ n : det(XY) ^ 
0}. 

Proof Since the function (101) can be represented as V = g(XY~ 1 X T , XJqX t ), where g : S n X A n — ► 
M is another Frechet differentiable function given by 

g{a,w) :=f(a + u>), (102) 

then the first claim of the theorem follows from the corresponding statements of Lemmas 5 and 6. The fulfill- 
ment of the PDE (77) for the function (101) can also be verified directly using its partial Frechet derivatives 

d x V = 2(S(f')XY- 1 - A(f)XJ ), (103) 
dyV = ~Y- 1 X T S(f')XY- 1 , (104) 

which follow from the relations d^g = S(f') and d^g = A(/') for the function g in (102). Now, to prove 
that (101) is, in fact, the general solution of the PDE over any connected component of the set dct(XY) ^ 0, 
we employ the transformation (X, Y) h-> (X, W), with W given by (87). This is a diffeomorphism since, 
for any nonsingular X, the matrix Y is uniquely and smoothly recovered from W as Y = X T W~ 1 X. The 
action of the operator M from (78) on the function h(X, W) := V(X, Y) written in the new independent 
variables X and W takes the form 

M(V) = ^X T (d x h + (dxW) t (d w h)) 

+X T W- 1 X(d Y W) t (dwh) = ^X T d x h, (105) 

where the terms containing dw h cancel each other due to the structure of the operators dx W and dy W 
from (88) employed in the proof of Lemma 5. Substitution of (105) into the PDE (77) leads to the PDE 
S(X T dx hJo) = 0. By considering this last PDE for a fixed but otherwise arbitrary nonsingular W G S n , 
and applying Lemma 6, it follows that its general solution over any connected component of the set dct X ^ 
is described by h(X,W) = ip(W,XJ X T ), where ip : S n X A n — y R is a Frechet differentiable 
function. Since any such tp can be identified with a Frechet differentiable function / : R nxn — ► R by 
f(N) = ip(S(N), A(N)), cf. (102), this proves the second claim of Theorem 3. ■ 

Finally, Theorem 1 is obtained by applying Theorem 3 to the minimum cost function Vt(P) for fixed but 
otherwise arbitrary t and Pn, assuming its Frechet smoothness on the set where the blocks of the covariance 
matrix P from (36) satisfy det Pi 2 ^ and P22 y 0. 
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B Special linear operators on matrices 

Following [22], we define, for any matrices a g R sx ? and g R9 xt , a linear operator |o, 0J : R px<3 -)■ 
R sx * by 

IK/3JP0 :=qX/3. (106) 

The generalization of this construct to matrices ct\,...,a r g R sxp and 0\,...,0 r g R 9Xt , with r an 
arbitrary positive integer, leads to a linear operator 

r 

lai,/3i | ... | := X>fc>AJ> ( 107 ) 

fc=l 

where the matrix pairs are separated by "|"s. Of particular importance are self-adjoint linear operators on the 
Hilbert space RP X « of the form (107) where c*i , . .. , a r g W X P and lt ...,0 r 6 R 9X<? are such that for 
any fc = 1, . . . , r, the matrices a fc and /3 fe are either both symmetric or both antisymmetric. Such an operator 
(107) is referred to as a self-adjoint operator of grade r, with the self-adjointness understood in the sense of 
the Frobenius inner product on W x i, so that |a, @p = |a T , /3 T ]]. 

Lemma 7 f227 If a g M pxp and /3 g R 9X ' 3 are io(/i antisymmetric, then the spectrum of |a, /3| i.v 
symmetric about the origin. If a and are both symmetric and positive (semi-) definite, then |a, /3| i.v 
positive (semi-) definite, respectively. 

Whilst the operator (106) with nonsingular a and is straightforwardly invertible, so that |ct, /3J = 
|a _ 1 , 0~ 1 J , the inverse of the operator from ( 1 07) with r > 1 , in general, can only be computed using the 
vectorization of matrices [10, 18] as 

Iai,/3i | ... | a^^r^y) = vcc- 1 ( 7 - 1 vcc(y)), 

provided 7 := Y7k=i &k ® a fc ' s nonsingular. Here, vcc(y) is the vector obtained by writing the columns 
of a matrix Y one underneath the other. 



